An Evolutionary Gene Selection Method for Microarray Data Based on SVM Error Bound Theories
نویسندگان
چکیده
Microarrays have thousands to tens of thousands of gene features but patient samples are fewer or a few hundred. Identifying genes whose disruption causes congenital or acquired disease is the fundamental problem in microarray data analysis. In this paper, we propose an efficient evolutionary SVM-based classifier that can select smaller number of features with high accuracy. The proposed method uses SVM with a given subset of features to evaluate the fitness function, and new subset of features are selected based on several leave-one-out error bounds for the SVM classifier and the frequency of occurrence of the features in the evolutionary approach. We test our proposed method on different microarray data and find that the proposed method can obtain high classification accuracy with a smaller number of selected genes.
منابع مشابه
A Comparison of SVM-based Evolutionary Methods for Multicategory Cancer Diagnosis using Microarray Gene Expression Data
Selection of relevant genes that will give higher accuracy for sample classification (for example, to distinguish cancerous from normal tissues) is a common task in most microarray data studies. An evolutionary method based on generalization error bound theory of support vector machine (SVM) can select a subset of potentially informative genes for SVM classifier very efficiently. The bound theo...
متن کاملA Comparison of SVM-based Criteria in Evolutionary Method for Gene Selection and Classification of Microarray Data
An evolutionary method whose selection and recombination operations are based on generalization error-bounds of support vector machine (SVM) can select a subset of potentially informative genes for SVM classifier very efficiently [7]. In this paper, we will use the derivative of error-bound (first-order criteria) to select and recombine gene features in the evolutionary process, and compare the...
متن کاملAn evolutionary approach for gene selection and classification of microarray data based on SVM error-bound theories
Microarrays have thousands to tens-of-thousands of gene features, but only a few hundred patient samples are available. The fundamental problem in microarray data analysis is identifying genes whose disruption causes congenital or acquired disease in humans. In this paper, we propose a new evolutionary method that can efficiently select a subset of potentially informative genes for support vect...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009